Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 2398116 |
| Missing cells | 1427778 |
| Missing cells (%) | 4.3% |
| Duplicate rows | 355 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 256.1 MiB |
| Average record size in memory | 112.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 3 |
| Dataset has 355 (< 0.1%) duplicate rows | Duplicates |
mother_body_mass_index is highly overall correlated with mother_delivery_weight | High correlation |
mother_delivery_weight is highly overall correlated with mother_body_mass_index | High correlation |
father_education is highly overall correlated with mother_marital_status | High correlation |
mother_marital_status is highly overall correlated with father_education | High correlation |
previous_cesarean is highly imbalanced (60.0%) | Imbalance |
mother_body_mass_index has 146600 (6.1%) missing values | Missing |
mother_marital_status has 412510 (17.2%) missing values | Missing |
mother_delivery_weight has 34958 (1.5%) missing values | Missing |
mother_height has 244529 (10.2%) missing values | Missing |
mother_weight_gain has 73473 (3.1%) missing values | Missing |
father_age has 444506 (18.5%) missing values | Missing |
number_prenatal_visits has 59901 (2.5%) missing values | Missing |
mother_weight_gain has 68723 (2.9%) zeros | Zeros |
cigarettes_before_pregnancy has 2186624 (91.2%) zeros | Zeros |
prenatal_care_month has 40409 (1.7%) zeros | Zeros |
number_prenatal_visits has 40409 (1.7%) zeros | Zeros |
Reproduction
| Analysis started | 2023-05-09 15:03:41.819591 |
|---|---|
| Analysis finished | 2023-05-09 15:06:05.082119 |
| Duration | 2 minutes and 23.26 seconds |
| Software version | ydata-profiling vv4.1.2 |
| Download configuration | config.json |
mother_body_mass_index
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 561 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 146600 |
| Missing (%) | 6.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.16721 |
| Minimum | 13 |
|---|---|
| Maximum | 69.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 22.3 |
| median | 25.7 |
| Q3 | 30.7 |
| 95-th percentile | 40.3 |
| Maximum | 69.8 |
| Range | 56.8 |
| Interquartile range (IQR) | 8.4 |
Descriptive statistics
| Standard deviation | 6.7557576 |
|---|---|
| Coefficient of variation (CV) | 0.24867322 |
| Kurtosis | 1.7558702 |
| Mean | 27.16721 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.1826656 |
| Sum | 61167408 |
| Variance | 45.640261 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26.6 | 41362 | 1.7% |
| 28.3 | 36430 | 1.5% |
| 23 | 29869 | 1.2% |
| 22.3 | 28077 | 1.2% |
| 25.8 | 27046 | 1.1% |
| 27.4 | 26855 | 1.1% |
| 21.3 | 25838 | 1.1% |
| 21.9 | 25337 | 1.1% |
| 21.6 | 25100 | 1.0% |
| 25.7 | 25015 | 1.0% |
| Other values (551) | 1960587 | |
| (Missing) | 146600 | 6.1% |
| Value | Count | Frequency (%) |
| 13 | 8 | < 0.1% |
| 13.1 | 14 | < 0.1% |
| 13.2 | 21 | < 0.1% |
| 13.3 | 28 | |
| 13.4 | 26 | |
| 13.5 | 29 | |
| 13.6 | 33 | |
| 13.7 | 63 | |
| 13.8 | 27 | |
| 13.9 | 48 |
| Value | Count | Frequency (%) |
| 69.8 | 2 | < 0.1% |
| 69.7 | 2 | < 0.1% |
| 69.5 | 3 | < 0.1% |
| 69.4 | 1 | < 0.1% |
| 69.3 | 2 | < 0.1% |
| 69.1 | 5 | < 0.1% |
| 68.9 | 1 | < 0.1% |
| 68.8 | 3 | < 0.1% |
| 68.7 | 29 | |
| 68.6 | 1 | < 0.1% |
mother_marital_status
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 412510 |
| Missing (%) | 17.2% |
| Memory size | 18.3 MiB |
| 1.0 | |
|---|---|
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 5956818 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1192238 | |
| 2.0 | 793368 | |
| (Missing) | 412510 | 17.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 1192238 | |
| 2.0 | 793368 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1985606 | |
| 0 | 1985606 | |
| 1 | 1192238 | |
| 2 | 793368 | 13.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3971212 | |
| Other Punctuation | 1985606 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1985606 | |
| 1 | 1192238 | |
| 2 | 793368 | 20.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1985606 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5956818 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 1985606 | |
| 0 | 1985606 | |
| 1 | 1192238 | |
| 2 | 793368 | 13.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5956818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 1985606 | |
| 0 | 1985606 | |
| 1 | 1192238 | |
| 2 | 793368 | 13.3% |
mother_delivery_weight
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 301 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34958 |
| Missing (%) | 1.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 188.31698 |
| Minimum | 100 |
|---|---|
| Maximum | 400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 135 |
| Q1 | 159 |
| median | 181 |
| Q3 | 210 |
| 95-th percentile | 267 |
| Maximum | 400 |
| Range | 300 |
| Interquartile range (IQR) | 51 |
Descriptive statistics
| Standard deviation | 41.369241 |
|---|---|
| Coefficient of variation (CV) | 0.21967876 |
| Kurtosis | 1.6461613 |
| Mean | 188.31698 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 1.0477953 |
| Sum | 4.4502278 × 108 |
| Variance | 1711.4141 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 160 | 47979 | 2.0% |
| 180 | 47202 | 2.0% |
| 170 | 44111 | 1.8% |
| 150 | 40506 | 1.7% |
| 165 | 37313 | 1.6% |
| 190 | 35472 | 1.5% |
| 200 | 34388 | 1.4% |
| 175 | 33954 | 1.4% |
| 185 | 31250 | 1.3% |
| 155 | 28049 | 1.2% |
| Other values (291) | 1982934 | |
| (Missing) | 34958 | 1.5% |
| Value | Count | Frequency (%) |
| 100 | 1147 | |
| 101 | 184 | < 0.1% |
| 102 | 247 | < 0.1% |
| 103 | 285 | < 0.1% |
| 104 | 272 | < 0.1% |
| 105 | 556 | |
| 106 | 427 | < 0.1% |
| 107 | 463 | |
| 108 | 662 | |
| 109 | 560 |
| Value | Count | Frequency (%) |
| 400 | 1209 | |
| 399 | 47 | < 0.1% |
| 398 | 67 | < 0.1% |
| 397 | 41 | < 0.1% |
| 396 | 54 | < 0.1% |
| 395 | 56 | < 0.1% |
| 394 | 38 | < 0.1% |
| 393 | 47 | < 0.1% |
| 392 | 55 | < 0.1% |
| 391 | 49 | < 0.1% |
mother_race
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5223421 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.1115543 |
|---|---|
| Coefficient of variation (CV) | 0.73016062 |
| Kurtosis | 5.978017 |
| Mean | 1.5223421 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.5217597 |
| Sum | 3650753 |
| Variance | 1.2355529 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1764845 | |
| 2 | 379173 | 15.8% |
| 4 | 159734 | 6.7% |
| 6 | 63288 | 2.6% |
| 3 | 23241 | 1.0% |
| 5 | 7835 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 1764845 | |
| 2 | 379173 | 15.8% |
| 3 | 23241 | 1.0% |
| 4 | 159734 | 6.7% |
| 5 | 7835 | 0.3% |
| 6 | 63288 | 2.6% |
| Value | Count | Frequency (%) |
| 6 | 63288 | 2.6% |
| 5 | 7835 | 0.3% |
| 4 | 159734 | 6.7% |
| 3 | 23241 | 1.0% |
| 2 | 379173 | 15.8% |
| 1 | 1764845 |
mother_height
Real number (ℝ)
| Distinct | 46 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 244529 |
| Missing (%) | 10.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.121252 |
| Minimum | 30 |
|---|---|
| Maximum | 78 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 30 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 62 |
| median | 64 |
| Q3 | 66 |
| 95-th percentile | 69 |
| Maximum | 78 |
| Range | 48 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.8355252 |
|---|---|
| Coefficient of variation (CV) | 0.044221301 |
| Kurtosis | 0.79989766 |
| Mean | 64.121252 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.080771473 |
| Sum | 1.380907 × 108 |
| Variance | 8.0402031 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 64 | 319699 | |
| 63 | 281534 | |
| 62 | 273071 | |
| 65 | 257357 | |
| 66 | 237395 | |
| 67 | 187786 | |
| 61 | 155167 | |
| 60 | 119183 | 5.0% |
| 68 | 103323 | 4.3% |
| 69 | 69668 | 2.9% |
| Other values (36) | 149404 | |
| (Missing) | 244529 |
| Value | Count | Frequency (%) |
| 30 | 10 | |
| 31 | 2 | < 0.1% |
| 32 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 36 | 7 | |
| 37 | 3 | < 0.1% |
| 38 | 3 | < 0.1% |
| 39 | 7 | |
| 40 | 2 | < 0.1% |
| 41 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 78 | 438 | < 0.1% |
| 77 | 266 | < 0.1% |
| 76 | 269 | < 0.1% |
| 75 | 513 | < 0.1% |
| 74 | 1401 | 0.1% |
| 73 | 2774 | 0.1% |
| 72 | 8406 | 0.4% |
| 71 | 18487 | 0.8% |
| 70 | 34409 | |
| 69 | 69668 |
mother_weight_gain
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 99 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73473 |
| Missing (%) | 3.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.483728 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 68723 |
| Zeros (%) | 2.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 20 |
| median | 29 |
| Q3 | 38 |
| 95-th percentile | 55 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 15.146299 |
|---|---|
| Coefficient of variation (CV) | 0.51371723 |
| Kurtosis | 1.0413842 |
| Mean | 29.483728 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.54174921 |
| Sum | 68539142 |
| Variance | 229.41038 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 110254 | 4.6% |
| 20 | 84648 | 3.5% |
| 25 | 80517 | 3.4% |
| 35 | 73636 | 3.1% |
| 0 | 68723 | 2.9% |
| 40 | 67846 | 2.8% |
| 28 | 64233 | 2.7% |
| 27 | 61887 | 2.6% |
| 32 | 61150 | 2.5% |
| 33 | 59945 | 2.5% |
| Other values (89) | 1591804 | |
| (Missing) | 73473 | 3.1% |
| Value | Count | Frequency (%) |
| 0 | 68723 | |
| 1 | 8937 | 0.4% |
| 2 | 10713 | 0.4% |
| 3 | 11199 | 0.5% |
| 4 | 12452 | 0.5% |
| 5 | 16264 | 0.7% |
| 6 | 15585 | 0.6% |
| 7 | 16925 | 0.7% |
| 8 | 19003 | 0.8% |
| 9 | 19008 | 0.8% |
| Value | Count | Frequency (%) |
| 98 | 3513 | |
| 97 | 181 | < 0.1% |
| 96 | 215 | < 0.1% |
| 95 | 320 | < 0.1% |
| 94 | 232 | < 0.1% |
| 93 | 228 | < 0.1% |
| 92 | 265 | < 0.1% |
| 91 | 251 | < 0.1% |
| 90 | 725 | < 0.1% |
| 89 | 336 | < 0.1% |
father_age
Real number (ℝ)
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 444506 |
| Missing (%) | 18.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.801093 |
| Minimum | 11 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 27 |
| median | 31 |
| Q3 | 36 |
| 95-th percentile | 44 |
| Maximum | 98 |
| Range | 87 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 6.8126466 |
|---|---|
| Coefficient of variation (CV) | 0.21422681 |
| Kurtosis | 0.90039787 |
| Mean | 31.801093 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.55440606 |
| Sum | 62126933 |
| Variance | 46.412153 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32 | 120181 | 5.0% |
| 31 | 118844 | 5.0% |
| 33 | 116729 | 4.9% |
| 30 | 116504 | 4.9% |
| 29 | 109740 | 4.6% |
| 34 | 109318 | 4.6% |
| 28 | 103843 | 4.3% |
| 35 | 101883 | 4.2% |
| 27 | 95096 | 4.0% |
| 36 | 91339 | 3.8% |
| Other values (68) | 870133 | |
| (Missing) | 444506 |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 13 | 12 | < 0.1% |
| 14 | 101 | < 0.1% |
| 15 | 444 | < 0.1% |
| 16 | 1627 | 0.1% |
| 17 | 4628 | 0.2% |
| 18 | 10652 | 0.4% |
| 19 | 19721 | |
| 20 | 29402 |
| Value | Count | Frequency (%) |
| 98 | 1 | < 0.1% |
| 91 | 1 | < 0.1% |
| 88 | 1 | < 0.1% |
| 85 | 1 | < 0.1% |
| 84 | 2 | < 0.1% |
| 83 | 6 | |
| 82 | 1 | < 0.1% |
| 81 | 2 | < 0.1% |
| 80 | 2 | < 0.1% |
| 79 | 2 | < 0.1% |
father_education
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.9042407 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.3058055 |
|---|---|
| Coefficient of variation (CV) | 0.47016565 |
| Kurtosis | -0.89317692 |
| Mean | 4.9042407 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.44140214 |
| Sum | 11760938 |
| Variance | 5.316739 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 619048 | |
| 6 | 412201 | |
| 4 | 386863 | |
| 9 | 329016 | |
| 2 | 186785 | 7.8% |
| 7 | 162676 | 6.8% |
| 5 | 151284 | 6.3% |
| 1 | 78382 | 3.3% |
| 8 | 71861 | 3.0% |
| Value | Count | Frequency (%) |
| 1 | 78382 | 3.3% |
| 2 | 186785 | 7.8% |
| 3 | 619048 | |
| 4 | 386863 | |
| 5 | 151284 | 6.3% |
| 6 | 412201 | |
| 7 | 162676 | 6.8% |
| 8 | 71861 | 3.0% |
| 9 | 329016 |
| Value | Count | Frequency (%) |
| 9 | 329016 | |
| 8 | 71861 | 3.0% |
| 7 | 162676 | 6.8% |
| 6 | 412201 | |
| 5 | 151284 | 6.3% |
| 4 | 386863 | |
| 3 | 619048 | |
| 2 | 186785 | 7.8% |
| 1 | 78382 | 3.3% |
cigarettes_before_pregnancy
Real number (ℝ)
| Distinct | 67 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11301 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1043881 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 2186624 |
| Zeros (%) | 91.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 10 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.7305202 |
|---|---|
| Coefficient of variation (CV) | 4.2833858 |
| Kurtosis | 74.378488 |
| Mean | 1.1043881 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.8454906 |
| Sum | 2635970 |
| Variance | 22.377821 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2186624 | |
| 20 | 60078 | 2.5% |
| 10 | 53333 | 2.2% |
| 5 | 21147 | 0.9% |
| 3 | 9887 | 0.4% |
| 2 | 7970 | 0.3% |
| 4 | 7551 | 0.3% |
| 1 | 6183 | 0.3% |
| 40 | 5996 | 0.3% |
| 6 | 5937 | 0.2% |
| Other values (57) | 22109 | 0.9% |
| (Missing) | 11301 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 2186624 | |
| 1 | 6183 | 0.3% |
| 2 | 7970 | 0.3% |
| 3 | 9887 | 0.4% |
| 4 | 7551 | 0.3% |
| 5 | 21147 | 0.9% |
| 6 | 5937 | 0.2% |
| 7 | 3680 | 0.2% |
| 8 | 3411 | 0.1% |
| 9 | 648 | < 0.1% |
| Value | Count | Frequency (%) |
| 98 | 439 | |
| 95 | 1 | < 0.1% |
| 92 | 1 | < 0.1% |
| 90 | 46 | < 0.1% |
| 88 | 6 | < 0.1% |
| 85 | 2 | < 0.1% |
| 84 | 2 | < 0.1% |
| 80 | 338 | |
| 76 | 1 | < 0.1% |
| 75 | 4 | < 0.1% |
prenatal_care_month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.2958756 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 40409 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 15.055082 |
|---|---|
| Coefficient of variation (CV) | 2.8427937 |
| Kurtosis | 34.39938 |
| Mean | 5.2958756 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 6.0014092 |
| Sum | 12700124 |
| Variance | 226.65549 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 941813 | |
| 3 | 730167 | |
| 4 | 210075 | 8.8% |
| 1 | 139262 | 5.8% |
| 5 | 105509 | 4.4% |
| 6 | 65178 | 2.7% |
| 99 | 59760 | 2.5% |
| 7 | 51778 | 2.2% |
| 0 | 40409 | 1.7% |
| 8 | 38619 | 1.6% |
| Other values (2) | 15546 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 40409 | 1.7% |
| 1 | 139262 | 5.8% |
| 2 | 941813 | |
| 3 | 730167 | |
| 4 | 210075 | 8.8% |
| 5 | 105509 | 4.4% |
| 6 | 65178 | 2.7% |
| 7 | 51778 | 2.2% |
| 8 | 38619 | 1.6% |
| 9 | 15276 | 0.6% |
| Value | Count | Frequency (%) |
| 99 | 59760 | 2.5% |
| 10 | 270 | < 0.1% |
| 9 | 15276 | 0.6% |
| 8 | 38619 | 1.6% |
| 7 | 51778 | 2.2% |
| 6 | 65178 | 2.7% |
| 5 | 105509 | 4.4% |
| 4 | 210075 | 8.8% |
| 3 | 730167 | |
| 2 | 941813 |
number_prenatal_visits
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 81 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 59901 |
| Missing (%) | 2.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.293179 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 40409 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 9 |
| median | 12 |
| Q3 | 13 |
| 95-th percentile | 18 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 4.197046 |
|---|---|
| Coefficient of variation (CV) | 0.37164435 |
| Kurtosis | 5.8723265 |
| Mean | 11.293179 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.6245808 |
| Sum | 26405880 |
| Variance | 17.615195 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 370513 | |
| 10 | 313608 | |
| 11 | 238126 | |
| 13 | 229898 | |
| 14 | 193076 | |
| 9 | 155579 | 6.5% |
| 15 | 146015 | 6.1% |
| 8 | 130915 | 5.5% |
| 7 | 82978 | 3.5% |
| 16 | 75910 | 3.2% |
| Other values (71) | 401597 |
| Value | Count | Frequency (%) |
| 0 | 40409 | 1.7% |
| 1 | 10725 | 0.4% |
| 2 | 17729 | 0.7% |
| 3 | 25167 | 1.0% |
| 4 | 34934 | 1.5% |
| 5 | 49750 | 2.1% |
| 6 | 66832 | |
| 7 | 82978 | |
| 8 | 130915 | |
| 9 | 155579 |
| Value | Count | Frequency (%) |
| 98 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 90 | 2 | |
| 89 | 1 | < 0.1% |
| 84 | 3 | |
| 77 | 1 | < 0.1% |
| 76 | 3 | |
| 75 | 4 | |
| 74 | 1 | < 0.1% |
| 73 | 1 | < 0.1% |
previous_cesarean
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.3 MiB |
| N | |
|---|---|
| Y | |
| U | 1570 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2398116 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N |
|---|---|
| 2nd row | N |
| 3rd row | N |
| 4th row | N |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| N | 2020874 | |
| Y | 375672 | 15.7% |
| U | 1570 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| n | 2020874 | |
| y | 375672 | 15.7% |
| u | 1570 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2020874 | |
| Y | 375672 | 15.7% |
| U | 1570 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2398116 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2020874 | |
| Y | 375672 | 15.7% |
| U | 1570 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2398116 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2020874 | |
| Y | 375672 | 15.7% |
| U | 1570 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2398116 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2020874 | |
| Y | 375672 | 15.7% |
| U | 1570 | 0.1% |
newborn_gender
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 18.3 MiB |
| M | |
|---|---|
| F |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2398116 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | M |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| M | 1225891 | |
| F | 1172225 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 1225891 | |
| f | 1172225 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1225891 | |
| F | 1172225 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2398116 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1225891 | |
| F | 1172225 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2398116 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1225891 | |
| F | 1172225 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2398116 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1225891 | |
| F | 1172225 |
newborn_weight
Real number (ℝ)
| Distinct | 5195 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3261.8353 |
| Minimum | 227 |
|---|---|
| Maximum | 8165 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 18.3 MiB |
Quantile statistics
| Minimum | 227 |
|---|---|
| 5-th percentile | 2268 |
| Q1 | 2960 |
| median | 3300 |
| Q3 | 3629 |
| 95-th percentile | 4120 |
| Maximum | 8165 |
| Range | 7938 |
| Interquartile range (IQR) | 669 |
Descriptive statistics
| Standard deviation | 590.47237 |
|---|---|
| Coefficient of variation (CV) | 0.18102458 |
| Kurtosis | 2.7221499 |
| Mean | 3261.8353 |
| Median Absolute Deviation (MAD) | 330 |
| Skewness | -0.86348702 |
| Sum | 7.8222595 × 109 |
| Variance | 348657.62 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3260 | 26534 | 1.1% |
| 3430 | 25512 | 1.1% |
| 3090 | 23031 | 1.0% |
| 3600 | 22025 | 0.9% |
| 3345 | 20816 | 0.9% |
| 3175 | 20195 | 0.8% |
| 3515 | 18844 | 0.8% |
| 3402 | 18425 | 0.8% |
| 3289 | 18239 | 0.8% |
| 3374 | 17924 | 0.7% |
| Other values (5185) | 2186571 |
| Value | Count | Frequency (%) |
| 227 | 87 | |
| 228 | 2 | < 0.1% |
| 229 | 5 | < 0.1% |
| 230 | 18 | < 0.1% |
| 231 | 1 | < 0.1% |
| 232 | 8 | < 0.1% |
| 233 | 2 | < 0.1% |
| 235 | 12 | < 0.1% |
| 236 | 6 | < 0.1% |
| 237 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8165 | 7 | |
| 8160 | 1 | < 0.1% |
| 7975 | 1 | < 0.1% |
| 7940 | 2 | < 0.1% |
| 7815 | 1 | < 0.1% |
| 7757 | 1 | < 0.1% |
| 7730 | 1 | < 0.1% |
| 7710 | 1 | < 0.1% |
| 7626 | 1 | < 0.1% |
| 7352 | 1 | < 0.1% |
| mother_body_mass_index | mother_delivery_weight | mother_race | mother_height | mother_weight_gain | father_age | father_education | cigarettes_before_pregnancy | prenatal_care_month | number_prenatal_visits | newborn_weight | mother_marital_status | previous_cesarean | newborn_gender | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| mother_body_mass_index | 1.000 | 0.809 | 0.001 | -0.043 | -0.260 | 0.009 | -0.121 | 0.010 | 0.001 | 0.034 | 0.091 | 0.088 | 0.084 | 0.000 |
| mother_delivery_weight | 0.809 | 1.000 | -0.038 | 0.360 | 0.172 | 0.026 | -0.048 | 0.031 | -0.022 | 0.089 | 0.212 | 0.076 | 0.059 | 0.011 |
| mother_race | 0.001 | -0.038 | 1.000 | -0.050 | -0.051 | 0.044 | 0.105 | -0.049 | 0.049 | -0.081 | -0.135 | 0.308 | 0.018 | 0.005 |
| mother_height | -0.043 | 0.360 | -0.050 | 1.000 | 0.126 | 0.083 | 0.117 | 0.024 | -0.025 | 0.052 | 0.160 | 0.082 | 0.034 | 0.000 |
| mother_weight_gain | -0.260 | 0.172 | -0.051 | 0.126 | 1.000 | -0.024 | 0.055 | 0.022 | -0.034 | 0.088 | 0.173 | 0.095 | 0.030 | 0.027 |
| father_age | 0.009 | 0.026 | 0.044 | 0.083 | -0.024 | 1.000 | 0.279 | -0.066 | -0.039 | 0.047 | 0.033 | 0.326 | 0.082 | 0.003 |
| father_education | -0.121 | -0.048 | 0.105 | 0.117 | 0.055 | 0.279 | 1.000 | -0.026 | -0.023 | 0.010 | -0.010 | 0.553 | 0.022 | 0.003 |
| cigarettes_before_pregnancy | 0.010 | 0.031 | -0.049 | 0.024 | 0.022 | -0.066 | -0.026 | 1.000 | 0.049 | -0.057 | -0.079 | 0.163 | 0.011 | 0.000 |
| prenatal_care_month | 0.001 | -0.022 | 0.049 | -0.025 | -0.034 | -0.039 | -0.023 | 0.049 | 1.000 | -0.324 | -0.010 | 0.021 | 0.020 | 0.000 |
| number_prenatal_visits | 0.034 | 0.089 | -0.081 | 0.052 | 0.088 | 0.047 | 0.010 | -0.057 | -0.324 | 1.000 | 0.149 | 0.136 | 0.019 | 0.008 |
| newborn_weight | 0.091 | 0.212 | -0.135 | 0.160 | 0.173 | 0.033 | -0.010 | -0.079 | -0.010 | 0.149 | 1.000 | 0.113 | 0.024 | 0.109 |
| mother_marital_status | 0.088 | 0.076 | 0.308 | 0.082 | 0.095 | 0.326 | 0.553 | 0.163 | 0.021 | 0.136 | 0.113 | 1.000 | 0.032 | 0.002 |
| previous_cesarean | 0.084 | 0.059 | 0.018 | 0.034 | 0.030 | 0.082 | 0.022 | 0.011 | 0.020 | 0.019 | 0.024 | 0.032 | 1.000 | 0.000 |
| newborn_gender | 0.000 | 0.011 | 0.005 | 0.000 | 0.027 | 0.003 | 0.003 | 0.000 | 0.000 | 0.008 | 0.109 | 0.002 | 0.000 | 1.000 |
| mother_body_mass_index | mother_marital_status | mother_delivery_weight | mother_race | mother_height | mother_weight_gain | father_age | father_education | cigarettes_before_pregnancy | prenatal_care_month | number_prenatal_visits | previous_cesarean | newborn_gender | newborn_weight | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 30.8 | 2.0 | 220.0 | 1 | 65.0 | 35.0 | 29.0 | 6 | 0.0 | 2 | 10.0 | N | F | 3045 |
| 1 | 45.8 | NaN | 293.0 | 1 | 64.0 | 26.0 | 37.0 | 4 | 0.0 | 3 | 10.0 | N | F | 3061 |
| 2 | NaN | 1.0 | NaN | 1 | 66.0 | NaN | 33.0 | 6 | 0.0 | 3 | NaN | N | F | 3827 |
| 3 | 24.3 | 1.0 | 157.0 | 1 | NaN | 20.0 | 27.0 | 6 | 0.0 | 3 | 9.0 | N | M | 3997 |
| 4 | 24.1 | 1.0 | 187.0 | 1 | 65.0 | 42.0 | 29.0 | 8 | 0.0 | 2 | 12.0 | N | F | 3240 |
| 5 | 30.9 | 2.0 | 231.0 | 1 | NaN | 51.0 | 27.0 | 3 | 0.0 | 4 | 10.0 | N | M | 3544 |
| 6 | 22.9 | 1.0 | 141.0 | 1 | NaN | 16.0 | 33.0 | 3 | 0.0 | 4 | 8.0 | N | M | 3010 |
| 7 | 28.3 | NaN | 182.0 | 1 | 65.0 | 12.0 | NaN | 5 | 11.0 | 3 | 15.0 | N | M | 3856 |
| 8 | 40.7 | 1.0 | NaN | 1 | 63.0 | NaN | 37.0 | 4 | 0.0 | 3 | 9.0 | Y | M | 1015 |
| 9 | 36.3 | 1.0 | 274.0 | 1 | 71.0 | 14.0 | 33.0 | 3 | 0.0 | 2 | 14.0 | N | F | 4450 |
| mother_body_mass_index | mother_marital_status | mother_delivery_weight | mother_race | mother_height | mother_weight_gain | father_age | father_education | cigarettes_before_pregnancy | prenatal_care_month | number_prenatal_visits | previous_cesarean | newborn_gender | newborn_weight | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2398106 | 31.2 | NaN | 172.0 | 1 | 60.0 | 12.0 | 25.0 | 1 | 0.0 | 2 | 15.0 | N | F | 2745 |
| 2398107 | 22.9 | 2.0 | 157.0 | 2 | 62.0 | 32.0 | 35.0 | 6 | 0.0 | 2 | 12.0 | N | F | 3080 |
| 2398108 | 38.6 | 1.0 | 253.0 | 4 | 64.0 | 28.0 | 35.0 | 7 | 0.0 | 3 | 12.0 | Y | M | 3997 |
| 2398109 | 23.3 | 1.0 | 170.0 | 4 | 65.0 | 30.0 | 34.0 | 8 | 0.0 | 2 | 7.0 | N | F | 3165 |
| 2398110 | NaN | 2.0 | 217.0 | 1 | 65.0 | 71.0 | 36.0 | 4 | 0.0 | 2 | 13.0 | N | M | 3986 |
| 2398111 | 22.1 | 1.0 | 152.0 | 1 | 63.0 | 27.0 | NaN | 4 | 0.0 | 4 | 5.0 | N | M | 3015 |
| 2398112 | 34.0 | 2.0 | 260.0 | 2 | 71.0 | 16.0 | 33.0 | 3 | 0.0 | 1 | 13.0 | N | M | 3572 |
| 2398113 | 24.6 | 1.0 | 157.0 | 1 | NaN | 18.0 | 26.0 | 4 | 0.0 | 3 | 15.0 | N | F | 3299 |
| 2398114 | 26.1 | NaN | 185.0 | 1 | 61.0 | 47.0 | 31.0 | 1 | 0.0 | 2 | 15.0 | N | M | 3062 |
| 2398115 | 23.0 | 1.0 | 172.0 | 4 | 63.0 | 42.0 | 43.0 | 6 | 0.0 | 5 | 9.0 | Y | M | 3660 |
Most frequently occurring
| mother_body_mass_index | mother_marital_status | mother_delivery_weight | mother_race | mother_height | mother_weight_gain | father_age | father_education | cigarettes_before_pregnancy | prenatal_care_month | number_prenatal_visits | previous_cesarean | newborn_gender | newborn_weight | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 353 | NaN | NaN | NaN | 4 | NaN | NaN | NaN | 9 | NaN | 99 | NaN | N | F | 3500 | 3 |
| 0 | 15.2 | 2.0 | 164.0 | 1 | 68.0 | 64.0 | NaN | 9 | 0.0 | 3 | 12.0 | N | M | 1503 | 2 |
| 1 | 17.4 | 1.0 | 137.0 | 1 | 62.0 | 42.0 | 30.0 | 6 | 0.0 | 2 | 40.0 | Y | M | 2150 | 2 |
| 2 | 17.6 | 2.0 | 122.0 | 1 | 60.0 | 32.0 | 18.0 | 2 | 0.0 | 5 | 7.0 | N | F | 2438 | 2 |
| 3 | 17.9 | 1.0 | 162.0 | 1 | 68.0 | 44.0 | 37.0 | 6 | 0.0 | 2 | 15.0 | N | F | 2920 | 2 |
| 4 | 18.0 | 1.0 | 152.0 | 1 | 67.0 | 37.0 | 26.0 | 4 | 0.0 | 3 | 9.0 | N | F | 2466 | 2 |
| 5 | 18.0 | 2.0 | 129.0 | 1 | 60.0 | 37.0 | 29.0 | 3 | 15.0 | 3 | 14.0 | N | M | 2010 | 2 |
| 6 | 18.2 | 1.0 | 170.0 | 1 | 68.0 | 50.0 | 34.0 | 8 | 0.0 | 3 | 12.0 | N | M | 2670 | 2 |
| 7 | 18.3 | 2.0 | 135.0 | 6 | 62.0 | 35.0 | NaN | 9 | 0.0 | 2 | 2.0 | N | M | 1418 | 2 |
| 8 | 18.3 | 2.0 | 138.0 | 2 | 65.0 | 28.0 | NaN | 9 | 0.0 | 3 | 12.0 | N | M | 3385 | 2 |